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Earth Science Informatics Comes of Age 


I. THE EMERGENCE OF 
EARTH SCIENCE INFORMATICS 

T he volume and complexity of Earth science data have 
steadily increased, placing ever-greater demands on 
researchers, software developers and data managers 
tasked with handling such data. Additional demands 
arise from requirements being levied by funding agen- 
cies and governments to better manage, preserve and 
provide open access to data. Fortunately, over the past 
10-15 years significant advances in information tech- 
nology, such as increased processing power, advanced 
programming languages, more sophisticated and practi- 
cal standards, and near-ubiquitous internet access have 
made the jobs of those acquiring, processing, distribut- 
ing and archiving data easier. These advances have also 
led to an increasing number of individuals entering 
the field of informatics as it applies to Geoscience and 
Remote Sensing. Informatics is the science and technol- 
ogy of applying computers and computational methods 
to the systematic analysis, management, interchange, 
and representation of data, information, and knowl- 
edge. Informatics also encompasses the use of comput- 
ers and computational methods to support decision- 
making and other applications for societal benefits. 

II. THE GRSS ESI TC 

The mission of the IEEE GRSS is "to advance science and 
technology in geoscience, remote sensing and related 
fields..." with the society's fields of interest being "the 
theory, concepts, and techniques of science and engi- 
neering as they apply to the remote sensing of the Earth, 
oceans, atmosphere, and space, as well as the process- 
ing, interpretation and dissemination of this informa- 
tion." Both the mission statement and the fields of inter- 
est of the IEEE GRSS clearly encompass Earth Science 
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Informatics (ESI). A large number of IEEE GRSS mem- 
bers work in the ESI area and at each IGARSS there are 
ESI related regular and invited sessions on topics such 
as GIS, semantic web, data provenance, sensor web, 
GEOSS, standards, data processing, data management, 
and decision support. However, until recently GRSS had 
yet to set up an ESI related technical committee. 

Given the rapid growth in informatics, GRSS decided 
to expand the original mission of its existing Data 
Archiving and Distribution Technical Committee (the 
DAD TC) and rename it the Earth Science Informatics 
Technical Committee (ESI TC), focusing on advancing 
the application of informatics to the geosciences and 
remote sensing. 

By establishing an ESI Technical Committee at GRSS 
we provide a home to GRSS ESI professionals, enabling 
them to exchange information and knowledge while 
setting a research agenda and making GRSS more vis- 
ible in the broader ESI community. We aim to provide 
technology advice to major national and international 
ESI initiatives. An ESI TC also helps GRSS attract more 
ESI professionals to the GRSS. 

III. THE KNOWLEDGE GENERATION LIFECYCLE 

The scope of the ESI TC can be better understood by 
considering the knowledge generation lifecycle, shown 
schematically at a high level in Figure 1. This lifecycle 
depicts the sequence of processes involved in knowledge 
generation and is useful in identifying where data and 
information can be enhanced or even lost. Standards play 
important roles at each stage of the knowledge generation 
lifecycle and some relevant categories of standards are 
listed at each stage to illustrate this fact. 

The scope of the original DAD TC was essentially 
limited to the data lifecycle, shown by the inner cycle 
of Figure 1. The data lifecycle is part of the more com- 
prehensive knowledge generation lifecycle, and could be 
said to underpin it. 
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FIGURE l. The Research Knowledge Generation Lifecycle. The inner cycle is the foundational data 
lifecycle, which is an integral aspect of the outer knowledge generation lifecycle. Example categories 
of standards that apply in each phase of the knowledge management lifecycle are shown. 


In the Design and Plan phase of the lifecycle it is impor- 
tant to consider how data will be acquired, evaluated, 
transferred, stored and documented. These activities are 
best captured in a data management plan, which is now a 
requirement of awards made by many agencies. While vari- 
ous agencies and organizations have developed guidelines 
and templates for writing data management plans, there 
has yet to be developed an international standard for this. 

A reference architecture can be helpful in designing the 
systems that will realize project goals in a way that makes the 
components and interfaces of that system more reusable and 
interoperable with other systems. Reference architectures rep- 
resent abstract solutions implementing the concepts and rela- 
tionships identified in a reference model, for which there are 
several standards such as OSI [1], OAIS [2] and RM-ODP [3]. 

Research projects often Collect Data from a suite of 
sensors which must be controlled, calibrated and moni- 
tored. Traceability to reference standards is a fundamental 
requirement for producing accurate and reliable data. There 
are also information standards specifying how to calibrate 
and document instrument performance. 

The Process Data phase of the lifecycle includes the many 
steps needed to harmonize and integrate data streams and 
otherwise prepare it for analysis. Conformance to standards 


can be of great benefit in this 
task. Standards are also useful 
in applying and documenting 
the outcomes of quality assur- 
ance steps. 

The ability to apply tools 
and algorithms in the Analyze 
Data phase is enhanced by 
the use of standards, such as 
for encoding, geospatial ref- 
erencing and portrayal. The 
reuse of software and proce- 
dures is also facilitated by the 
use of standards. 

It is coming to be recog- 
nized that it is important to 
preserve all the outputs of the 
research process, not just pub- 
lications. Reproducibility and 
traceability demand that the 
data behind the publication 
be documented, preserved 
and made available. Placing 
data into a trusted repository, 
assigning persistent identifiers 
to data and referring to those 
PIDs in the publications is now 
considered an essential part of 
the Publish Results phase. 

Finally, data must be dis- 
coverable and accessible so 
that future research can build 
upon those results. The traditional approach to Discovery 
and Reuse, i.e. placing the data in an archive and populat- 
ing a metadata catalog, is being extended through linked 
data and semantic technologies. Of particular importance is 
the ability for data to be used by disciplines and in contexts 
other than those in which the data were generated. Media- 
tion and brokering technologies are beginning to be applied 
to meet this challenge [4]. 

IV. STANDARDS DEVELOPMENT AND USAGE 

One of key elements of the ESI TC mission is to help develop 
and employ standards and best practices that are needed to 
make both data and data systems usable and interoperable. 
The GRSS ESI TC is pursuing this objective through partici- 
pation in, and collaboration with the Open Geospatial Con- 
sortium, OGC [5], Technical Committee 211 of the Inter- 
national Organization for Standardization, ISO TC211 [6], 
and the IEEE Standards Association, IEEE-SA [7]. 

The Open Geospatial Consortium develops geospatial 
standards that are in widespread use within the geoscience 
community. Among the more commonly known standards 
and specification that the OGC has developed are: 

I CSW — Catalog Service for the Web 
> GML — Geography Markup Language 
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I SOS — Sensor Observation Service 

I SensorML — Sensor Model Language 

I W*S— Suite of Web Services 

I GeoSPARQL — for representation and querying of geo- 
spatial data for the Semantic Web. 

A recently established MOU between GRSS and OGC 
will enhance the cooperation and provide support to the 
GRSS Earth Science Informatics (ESI) Technical Committee. 
GRSS will provide support to the OGC Earth Systems Sci- 
ence (ESS) Domain Working Group (DWG), contributing to 
the ESS discussions based on GRSS developments and rec- 
ommending GRSS related presentations at OGC ESS meet- 
ings. GRSS and OGC agree to jointly support presentations, 
journal articles and other related outreach to highlight the 
applicability and benefits of geoscience interoperability. 
OGC and GRSS will work to involve other relevant stan- 
dards consortia and professional organizations in the devel- 
opment and advancement of geoscience interoperability. 

ISO/TC211 develops international standards for geo- 
graphic information, addressing the methods, tools and 
services for management and interoperability of geospatial 
data. Among the standards that are relevant to the routine 
activities of many GRSS members are: 

> ISO 19115— Metadata 

> ISO 19119 — Services 

I ISO 19130 — Imagery sensor models for geopositioning 

I ISO 19139 — Metadata — XML schema implementation 

» ISO 19157— Data Quality 

I ISO 19159 — Calibration and validation of remote sens- 
ing imagery sensors and data. 

Currently under development are standards for repre- 
senting concepts that support the interpretation of, and 
reasoning with geographic information (ISO 19150), and 
a common content model for imagery formats (ISO 19163). 

GRSS established a liaison relationship with ISO/TC211 
in 2004 and has since made regular presentations to its Ple- 
nary on GRSS' activities, and has had regular representa- 
tion on its projects and committees. 

The IEEE Standards Association facilitates standards 
development and standards related collaboration to 
advance global technologies. The IEEE-SA has overseen the 
development of many of standards that are at the heart of 
the information infrastructure. GRSS interfaces with the 
IEEE-SA through Standards Coordinating Committee 40 
(SCC40 — Earth Observations). 

V. OVERVIEW OF THE ESI TC 

As stated earlier, the mission of the ESI TC is to advance the 
application of informatics to the geosciences and remote 
sensing, and to provide a platform for ESI professionals to 
collaborate. The fields of interest of the ESI TC include, but 
are not limited to: 

I Data and information policies, stewardship, preserva- 
tion, provenance and quality 

I Knowledge representation, information models for the 
spatial and temporal relationships between entities in 


the Geosciences (e.g., spatial and process ontologies, 
vocabularies, semantic web) 

I Cyberinfrastructures, interoperability, standardization, 
web service, sensor web and cloud computing 

> Improving data discovery and access 

I Tools supporting spatial and temporal analyses and their 
applications including decision support systems, tools 
and systems to model the Earth system, tools to visualize 
and analyze geoscience data, information, and knowledge 

> Emerging information technologies trends and both 
their impact and applications in the geosciences. 

The ESI TC will be sponsoring two invited sessions at 
IGARSS 2014. The first session, titled "Implications of Big 
Data to Remote Sensing," will focus on evaluating different 
big data technologies that leverage a "shared nothing archi- 
tecture" and distributed file storage systems to support reli- 
able processing and analysis of satellite imagery. The second 
session is a joint ESI TC and OGC session titled "Advanc- 
ing Science through Management of the Geospatial Data 
Lifecycle". The focus of this session is to explore the role of 
standards at different stages of the data lifecycle. As science 
becomes more reliant on information technology, data stan- 
dards are as vital as uniform standards for weights and mea- 
sures. In addition to these special sessions, ESI TC will seek 
to sponsor either a TGRS or JSTARS special issue focusing on 
specific Earth Science Informatics topics. 

VI. CALL FOR PARTICIPATION 

As science and technology progress, the knowledge genera- 
tion lifecycle evolves, impacting everyone involved includ- 
ing the scientists and engineers who design and operate 
instruments, processing systems and numerical models, 
and acquire, validate, analyze, manage and interpret data. 
GRSS members are thus encouraged to engage with the ESI 
TC in its mission to bring together those GRSS members 
interested in advancing the field of informatics. Specific 
opportunities to contribute include serving as subject mat- 
ter experts in the development and/or review of standards, 
presenting ESI related research at IGARSS and submitting 
papers to the special issue of the Journals. To participate 
contact the ESITC chairs Dr. Ramachandran and Yue (rama- 
chr@uah.edu, geopyue@gmail.com) and join the IEEE 
GRSS Earth Science Informatics group on Linkedln [8]. 
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